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Abstract 

Inspired by the recent literature on aggregation theory, we aim at relating the long range 
correlation of the stocks return volatility to the heterogeneity of the investors' expectations 
about the level of the future volatility. Based on a semi-parametric model of investors' an- 
ticipations, we make the connection between the distributional properties of the heterogene- 
ity parameters and the auto-covariance/auto-correlation functions of the realized volatility. 
We report different behaviors, or change of convention, whose observation depends on 
the market phase under consideration. In particular, we report and justify the fact that the 
volatility exhibits significantly longer memory during the phases of speculative bubble than 
during the phase of recovery following the collapse of a speculative bubble. 

JEL classification: GIO, G14, D84, C43, C53 
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Introduction 

The slow hyperbolic decay of the auto-correlation function characterizing the behavior of many 
economic and financial time series has been the topic of active debates for more than two 
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Forecasting Financial Markets and at the 23''* International Conference of the French Finance Association. 
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decades. Several interpretations and models have been provided in an attempt to explain the 
origin of this phenomenon, also known as the long-memory phenomenon. Among the most 
relevant explanations that have been proposed up to now, one can focus on three major mech- 
anisms acting separately or in conjunction: (i) the aggregation approach suggested by Granger 
and Joyeux (1980) who have shown that the time series resulting from the aggregation of 
micro-variables exhibiting short-memory often yields long-memory, (ii) the presence of infre- 
quent structural breaks, which allows mimicking long term non-stationarity of the economic 
and financial activity (Diebold and Inoue 2001, Gourieroux and Jasiak 2001, Granger and 
Hyung 2004, Gadea, Sabate, and Serrano 2004, among others), and (iii) the presence of non- 
linearities in economic an financial systems (see Davidson and Sibbersten (2005) for a survey). 

Our aim, in this article, is to provide a model that relates the long memory of the realized 
volatility of assets returns, which is a pervasive feature of financial time series (Taylor 1986, 
Ding, Granger, and Engle 1993, Dacoragna, Miiller, Nagler, Olsen, and Pictet 1993, for the 
pioneering works), to the heterogeneous behavior of the economic agents. Based on the fact 
that the market participants perform heterogeneous anticipations about the future level of the 
realized volatility and that they act as bounded rationality agents, we propose an explanation of 
the long memory phenomenon that relies on the aggregation by the market of the heterogeneous 
beliefs of the investors, revealed through the market pricing process. 

The heterogeneity of market investors is now an well-recognized fact - in particular amongst 
the supporters of behavioral finance (see Barberis and Thaller (2003) for a survey) - that can 
take several forms. Heterogeneity can first be considered as resulting from the diversity of the 
nature, the size and strategies of the economic agents: individual investors who invest their 
money to finance the education of their children, traders who manage money for their own ac- 
count or for the account of their clients, institutional investors who manage pension funds, and 
so on... do not have the same financial resources (depending on their size), the same purposes 
(short term or long term profits and allocation frequency for example), or the same skills. All 
these differences make them focusing on different pieces of information and therefore antici- 
pating differently the future value of the firms. 

In this respect, Di Matteo, Aste, and Dacorogna (2005) have recently provided evidence for 
the existence of a relation between the degree of heterogeneity amongst the market participants 
and the stage of development of financial markets. Indeed, relating the stage of maturity of 
a financial market to the speed of the hyperbolic decay of the auto-correlation function of the 
volatility of the assets traded on the market under consideration, Di Matteo et al. suggest that 
the more mature and efficient the market is, the larger is the number of different classes of agents 
and strategies and the smaller is the effect of long-memory. In addition, other phenomena such 
as mass psychology and contagion must be taken into account. Indeed, clear evidence has 
shown that rumors (Banerjee 1993), mimetism (Orlean 1995), herding (Banerjee 1992, Froot, 
Scharfstein, and Stein 1992, Kirman 1993, Cont and Bouchaud 2000), fashions (Shiller 1989) 
and so on, affect the agents' behavior. 

In order to account for these various sources of heterogeneity but still keep a parsimonious 
representation, we provide a model that relates the investors' behavior to few heterogeneity 
parameters that allow accounting for their individual tendency to perform optimal anticipations 
on the basis of the flow of incoming news they receive or, on the contrary, on the basis of a self- 
referential approach which leads them to mainly focus on their past anticipations and on their 
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past observations of the market volatility. This approach permits us to focus on the fact that, in 
addition to the heterogeneity of sizes and strategies, another main source of heterogeneity comes 
from the way the agents actually anticipate the many factors which impact future earnings of the 
firm and their volatility. These factors are captured, in our model, by help of a flow of incoming 
public and private information which can be considered as embedding several macroeconomic 
variables affecting stocks volatility such as the business cycles (see Schwert (1989) who has 
found a higher volatility of many key economic variables during the Great Recession), oil price 
whose volatility is important in explaining technology stock return volatility (Sadorsky 2003), 
or inflation and interest rates which have large impacts on the stock market volatility (Kearney 
and Daly 1998). 

Then, we make the connection between the distributional properties of the heterogeneity 
parameters and the auto-covariance/auto-correlation functions of the realized- volatility. It al- 
lows us to discuss the kind of economic behaviors that yields the long memory of the realized- 
volatility time series. Finally, the calibration of our model over the last decade, on a database 
of 24 US stocks of large and middle capitalizations, allows us reconstructing the distribution of 
the heterogeneity parameters and then to have access to the overall behavior of the investors. 
Notably different behaviors are observed, depending on the market phase under consideration 
with (i) a strong tendency to self-referential anticipations before the crash of the Internet bub- 
ble, and (ii) a redistribution in favor of the investors performing their forecasts on the basis the 
incoming piece of information after the crash. 

The paper is organized as follows. The next section briefly recalls some basic stylized 

facts about the so-called realized- volatility, a measure of the volatility introduced by Andersen, 
BoUerslev, Diebold, and Labys (2003) and Barndorff-Nielsen and Shephard (2002a), among 
others. Then in section 2, we present our model of bounded rationality investors with hetero- 
geneous beliefs in order to investigate the impact of both the agents' bounded rationality and 
their heterogeneous beliefs in a market that is assumed to perform an aggregation of the indi- 
vidual anticipations. In section 3, we discuss the calibration issues of the model and derive the 
asymptotic law of the estimator of the parameters of the model. The fourth section present our 
empirical results while the fifth section concludes. 

1 Stylized facts about the realized volatility of asset prices 

Many stylized facts about the volatility of financial asset prices have been reported in several 
studies over the recent years (Ding, Granger, and Engle 1993, Lo 1991, among many others). In 
particular, people now agree on the fact that (i) returns display, at any time scale, a high degree of 
variability which is revealed by the presence of irregular bursts of volatility and (ii) the volatility 
displays a positive auto-correlation over large time lags, which quantifies the fact that high 
(resp. low) volatility events tends to cluster in time, as already reported by Mandelbrot (1963) 
and Fama (1965) who first mentioned evidence that large changes in the price of assets are 
often followed by other large changes, and small changes are often followed by small changes. 
This behavior has also been noticed by several other studies, such as Baillie, BoUerslev, and 
Mikkelsen (1996), Chou (1988) or Schwert (1989) for instance. 

In this section, after we have recalled the definition of the notion of realized volatility, we 
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document some of the common features of the process of the asset price realized volatility and 
relate them to the relevant literature. 



1.1 Definition of tlie realized volatility 

Based upon the quadratic variation theory within a standard frictionless arbitrage-free pricing 
environment, Andersen, BoUerslev, Diebold, and Labys (2003) have suggested a general frame- 
work for the use of high frequency data in the measurement, the modeling and the prediction of 
the daily volatility of asset returns. In fact, as also recalled by Barndorff-Nielsen and Shephard 
(2002a, 2002b), when the underlying asset price process is a semi-martingale, the realized vari- 
ance (the squared realized volatility) provides a consistent estimator of the quadratic variation, 
in the limit of large samples. Indeed, denoting by Pj ^ the i^'^ observation of the asset price dur- 
ing the trading day t and by rj ^ = ln(Pj_i) — ln(Pi_i t) the continuously compounded return on 
the asset under consideration over the period i — 1 to i, the realized variance, at day t, defined 
by 



dt, (1) 



with rit the number of observations during this day, is a consistent estimator of the integrated 
variance of the price process. In addition, as shown by Barndorff-Nielsen and Shephard (2002a), 
the asymptotic properties of this estimator are such that 

-Ar(0,l), and y ^^Ar(0,l), (2) 



IE 



4 / 2 Z^i=l i,t 



where a{t) is the instantaneous (or spot) volatility of the log price process. 

If the estimator ([T]) does not require the time series of asset returns to be homoscedastic 
during day t, it is however assumed that the returns are uncorrected. Therefore, in order to 
account for the market microstructure effects, which may produce spurious correlations (Roll 
1984, for instance), it is often necessary either to filter the raw series of intraday returns to 
remove these correlation^ or to focus on sufficiently large time scales - 5 minutes instead of 1 
minute or tick by tick quotations, for instance - in order to smooth out the microstructure effects 
and correlations. The immediate drawback of this later approach is to decrease the number rit 
of available observations, which may bias the estimates. 



[Insert figure\T\about here] 



As an illustration, we present, on the figure [H the realized volatility for two time series 
drawn from our intraday database of 24 US stocks prices (see section 4 for details on the 
database). On the left panel, we can observe the realized volatility of the daily returns of a 

'See the comparative study in Bollen and Inder (2002) for instance. 
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middle capitalization stock (The Washington Post) during the time period from 01/01/1994 to 
12/31/2003 while, on the right panel, is depicted the realized volatility of a large capitalization 
stock (Coca-Cola) over the same time period. Since our investigation of the time dependence 
between the intraday returns of these asset prices has not revealed the presence of a significant 
correlation beyond the one minute time scale, the market microstucture effect are negligible, 
which allows us to directly estimate the realized volatility from the one-minute raw returns. 

[Insert figure \2\ about here] 



On the figure[2l we have drawn the auto-correlation function of the previous realized volatil- 
ity (still The Washington post on the left and Coca-Cola on the right) from lag to lag 250 days, 
which corresponds to a one year period or so. The slow decay is characteristic of the long mem- 
ory. However, the two graphs seem different. For The Washington Post the auto-correlation 
falls down to 0.4 quickly and then decreases very slowly, whereas for Coca-Cola, it decreases 
more regularly and the auto-correlation becomes not significantly different from zero beyond 
lag 180, or so. This general feature could mean that the correlation function of the volatility 
of large capitalization stocks would exhibit shorter memory than middle capitalization stocks. 
This remark, that will be confirmed later, is in line with the observation reported by Di Matteo, 
Aste, and Dacorogna (2005) and according to which the more efficient a stock (or a market), 
the faster the decay of the correlation function of its volatility. 

1.2 Normality of the log- volatility 

We now turn to the distributional properties of the realized volatility. As suggested by many pre- 
vious studies (Andersen, BoUerslev, Diebold, and Ebens 2001a, Andersen, BoUerslev, Diebold, 
and Labys 2001b, Barndorff-Nielsen and Shephard 2002b, among others), the log-normal distri- 
butior@ provides an adequate description of the distribution of the volatility, at least in the bulk 
of the distribution. These observations clearly support the modeling of the realized volatility in 
terms of log-normal models, which goes back to Clark (1973) (see also Taylor (1986)). 

Starting from these observations, let us illustrate the relevance of the log-normal model for 
the realized volatility of the returns on the prices of the assets in our database. To this aim, let us 
denote by {a;i}t>i the logarithm of the series of the realized volatility. Figure[3]shows the kernel 
estimate (Pagan and UUah 1999) of the density of the log- volatility of a mid-cap (Microchip 
Technology) on the left panel and of a large cap (Coca Cola) on the right one, over the whole 
time period. The density of log- volatility of the price returns of Microchip Technology seems 
very close to a normal density at the naked eye. On the overall, it is the case for all the mid- 
caps. On the contrary, the densities of the large caps present sharp peaks and fat tails so that 
they significantly depart from the normal density. These visual impressions will be formalized 
later on, in section |4l 

•^Notice that Barndorff-Nielsen and Shephard (2002b) also show that the inverse Gaussian law provides an 
accurate fit of the distribution of the log-volatility. In fact, the log-normal and the inverse Gaussian distributions 
are indistinguishably close to each other over the entire range of interest. 
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[Insert figure\3\about here] 



2 Heterogeneity model 

As recalled in introduction, several explanations have been proposed, in the now large body of 
literature about the long memory of financial time series, to point out the various origins of this 
phenomenon. The first one, addressed by Granger and Joyeux (1980), concerns the nature of 
financial time series. Indeed, they consider that financial series results from the aggregation of 
micro-variables and that this aggregation is responsible for the long memory. The second one 
is the presence of infrequent structural breaks (also called structural changes) in financial time 
series (Diebold and Inoue 2001, Gourieroux and Jasiak 2001, Granger and Hyung 2004, Gadea, 
Sabate, and Serrano 2004, Davidson and Sibbersten 2005, among others). 

Our model is based upon Granger and Joyeux (1980)'s proposition; it explains the long 
memory of the log- volatility of asset prices by the aggregation of micro- variables intended to 
represent the heterogeneous expectations of each market participant. For, we suppose that the 
market aggregates the agents' anticipations of the log-volatility and that this aggregation drives 
the realized log-volatility. We will formalize this assumption later on. 



2.1 General framework 

The role of the log-volatility as the central object underpinning our model comes from the 
remark that, as recalled in the previous section, it is reasonable, in a first approximation, to con- 
sider that the realized volatility follows a log-normal distribution. As an additional hypothesis, 
we will assume that 

(HI) the log-volatility process {ujt\t& follows a Gaussian stochastic process, 

which obviously ensures that the volatility itself has a log-normal stationary distribution. 

Now, the standard economic theory tells us that any rational agent facing a decision problem 
aims at optimizing the output of her actions, based upon the entire set of information she has 
at her disposal. In the present context, it means that any rational investor strives for the best 
prediction of the future realized volatility dt+r, t > 1, based on the set {ai, erf} of her past 
observations. In the sense of the minimum mean squared error, the best predictor is given by 

at+r = ElcTt+rWl, ■ ■ ■ ,Crt] , (3) 

= E[e"'+-|o;i,...,a;i], (4) 

where cot+r denotes the best predictor of the log- volatility, based on the same set of observations 
and (Xi^ (r)^ is the (r-step ahead) prediction error, i.e. the mean squared error E [(cDt+r — t^t+r)^] ■ 
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Since the log-volatility is assumed to follow a Gaussian process, the best predictor a)t_|_^ is given 
by a linear combination of the past observations and the past anticipations. In particular (see 
Brockwell and Davis (1990, pp. 162-168)) 

t-i 

LOt+l = + E '^^'^ ■ ~ ' 

i=0 

where the sequence of coefficients {(pt,i}i>o and the long term mean u depend on the specific 
model each investor relies on and on the particular calibration method she uses to estimate the 
structural parameters of her model. Thus, one expects that the (pi's are specific to each market 
actor so that each investor performs different expectations about the level of the future realized 
volatility. In addition, each agent can incorporate some exogenous economic variables or some 
piece of (private) information she thinks to improve her prediction. 

Besides, considering that the market carries out an aggregation of the agents' anticipations, 
we postulate that 

(H2) the realized log-volatility is the average of all the individual anticipa- 
tions. 

Thus, denoting by Ui^t the agent i's forecast, the realized log-volatility is given by 

1 " 

ut{n) = -^uji^t, (V) 
1=1 

if the market in made of n participants. Under the assumption of an infinitely large number of 
investors, the realized log-volatility writes 

Ut = lim ujt{n). (8) 

n— >oo 

Our approach can appear utterly simplistic insofar as the real process yielding the realized 
volatility certainly involves non-linear transforms of the individual anticipations before they 
are actually aggregated through the price formation process, which is well-known to rely on 
various positive or negative feedback mechanisms (Shiller 2000, Somette 2003). However, in 
this first attempt to capture the impact of the heterogeneity of the investors' anticipations on the 
dynamics of the realized volatility, and in the absence of arguments allowing us to model these 
non-linear interactions, it is necessary to restrict ourselves to this assumption of linearity. 

Moreover, in order to be able to make tractable calculations, we need some other simplifying 
assumptions. Focusing on agent i, we denote by Xi^t its expectation about the future level of 
the excess log-volatility over its long term mean 

Xi,t = uji,t - ^ (9) 

on day t, and by 

1 " 

^n,t = c^tW-a; = -EAt (10) 
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the excess of the realized log-volatility over its long term mean. We assume that the anticipa- 
tion Xi t depends on the anticipation, Xj the agent made the day before and on the excess 
realized log- volatility Xn,t-i, but also on a public piece of information et as well as on a private 
piece of information r/j ^ according to the recursion equation 

Xi^t = '^i ■ Xi^t-i + i^i ■ Xn,t~i + CiSt + rii^t t>0, 2 = 1,2, ...,n. (11) 

From equation one should expect that tpi = —ipi if we consider rational agents who strive 
for the best prediction (in the minimum mean-squared sense) of the volatility. However, in order 
to generalize our model to the case where the market participants can be bounded rationality 
agents, we allows for ifji ^ —^i- Nonetheless, we will assume that these coefficients remains 
constant from time to time. 

Let us stress that the first order dynamics (fTTI) can seem too simple, and it probably is. How- 
ever, our approach amounts to postulate that the agents assume that the (log)-volatility follows 
some kind of ARCH process, which is quite reasonable. In addition, even if real agents use more 
sophisticated prediction schemes to perform their expectations about the future level of the real- 
ized (log)-volatility, and thus use higher order dynamics, we can still make the assumption that 
these higher order dynamics can be reduced to an aggregation of first order dynamics. There- 
fore, Xj i does not exactly characterize the expectation of individual agent but more precisely 
of a class of agents. 

In this basic setting, the ra-dimensional vector Xt = ^-^i,t) • • • ? X„ ^ j follows a first order 

(vectorial) auto-regressive process. The Gaussian noises {st}, {f]i,t}, {fl2,t}, • • • , are indepen- 
dent, centered, and of variance NdiX{et) = cr^ and Var(?7j t) = a^^, Vz, respectively. Moreover 
the structural coefficients ((^i,tpi) and q, which characterize the investors' behavior, can be 
considered as the result of independent draws from the same law (called the heterogeneity dis- 
tribution). These draws are independent of the values of the noises and the variables (y?, ip) and 
c are independent of each other. The parameter c that allows introducing heteroscedasticity must 
be strictly positive, while the distribution of (v?, ip) has to fulfill some hypotheses, which will 
be made explicit hereafter, in order to ensure the stationarity of the stochastic process {Xt}tez 
and of the aggregated excess realized volatility {Xn,t}tez- 

The appeal of the dynamic (fTT)) rests on the simple interpretation that can be made of the 
structural coefficient {^p, tp) and of its distributional properties. It is obvious that the impact 
of the previous anticipation Xi^t-i on the current anticipation Xj ^ depends on the value of the 
heterogeneity coefficient ipi'. 

1. when is close to (but less than) one, the impact of the previous anticipation on the 
current anticipation is very important and, unless a very large piece a information arrives 
- i.e. unless one observes a large st and/or a large r]i^t - the current value of the expected 
excess log- volatility Xj f will be very close to the previous one. On the overall, the agent 
believes in the continuity of the previous market conditions; 

2. when (fi is close to (but larger than) minus one, the impact of the previous anticipation 
still remains more important than the arrival of a new piece of information, but the agent 
exhibits a systematic tendency to believe in the reversal of the volatility since her antici- 
pation appears to be the opposite of the one she made the day before; 
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3. eventually, when \ipi\ is close to 0, the anticipations are only scarcely related to those 
made the day before and mainly rely on the flow of incoming news. 

To sum up, in the two first situations, we can notice that the market is highly self-referential: the 
impact of a new incoming piece of information remains weak, all the more so the closer to one 
the magnitude of ipi. On the contrary, when Iv^l is close to zero, the degree of reactivity of the 
market participants to a new incoming piece of information is high. Thus, the magnitude of ip 
can be seen as a way to quantify the degree of efficiency of the market. The same considerations 
obviously holds for ^j, but instead of referring to the past anticipation it refers to the publicly 
observed past realized volatility. 



2.2 Properties of the (log-) volatility process 

Equation (fTTI) can be written in a more compact form as follows 

Xt = AXt-i+efC + Tit, (12) 

where A = D + -1^ is an n x n matrix with D = diag(v9i, . . . , </?„), \1/ = (V"!, ■ ■ ■ , V'n)' 
and In = jl, . .J , 1)' , while C = (ci, . . . , c„)' and r]t = {r]i^t, • • • , Vn,t)- Besides, the range of 

n times 

the distribution of (y?, -ip) is assumed to be such that the spectral radius of A is less the one. This 
condition is necessary and sufficient to ensure the stationarity of the vectorial process {Xt}t>o- 

Under the assumption < 1, that will be assumed in all the sequel of this article, the 
stationary and causal solution of equation (fT2l) is given by 



fc=0 k=0 

SO that, as proved in appendix |Al 

Proposition 1. In the limit of a large number of economic agents, n ^ oo, the excess realized 
log-volatility {Xt}tez follows an infinite order auto-regressive moving average process 

' oo \ / oo \ 

1 _ ^ E [i,^^] L'+' X, = E [c] ■ 5^ E [^'] L' e„ (14) 

k=Q J \k=0 / 

where L denotes the lag operator. 

Not surprisingly, the flows of private information {rji^t] does not come into play since, under 
our assumptions, it does not convey any piece of information, on average across agents. 



As a byproduct of the proposition above, we see that the solution to equation (|T4l) can conve- 
niently be expressed in terms of an infinite order moving-average provided that ^^pE [^/'v^'^] -2^"*"^ 7^ 
1 for all z inside the unit circle: 



X, = E[c]-^fc_fc, (15) 



fc=0 
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where the /J^'s are formally given by the coefficients of the power series 

oo 

J2^kx' : = 



k=0 



1 — X ■ (p 
1 

1 — xip 



(^1-xE 


\ 1 




1 — X ■ (f 



-1 



1 — xip 



(16) 



(17) 



where g{Lp) = E 

The process Yt := {J2T=o^ [y^^] stationary if and only if Xlfclo^ \}P^'T ^ 

which requires that the law of Lp be concentrated on (—1, 1). It is however not sufficient, as 
shown by Gon9alves and Gourieroux (1988). Furthermore, Yt exhibits long memory provided 
that |E [y)^] I = oo. Restricting our attention to the case where Yt is said to exhibit long 

memory if its spectral density 



/y(A) 



27r 



-ikX 



k=0 



Y e 



1 1 
2' 2 



(18) 



we conclude that the density of 9? - if it exists - should behave as (1 — (y?)"'^^ as ip goes to one. 
Actually, Yt exhibits long-memory if dy E (0, 1/2) and anti-persistence if dy G (—1/2, 0). The 
upper bound on dy is necessary in order for the process to be second order stationary. 

As for the filter A{L) := 1 - E [iP^d''] L^+^ in the left-hand side of dM]), it is well de- 
fined if Xlfclo E [V^'/'^] ^ < 00 which, by Cauchy-Schwartz inequality, is satisfied if E [-f/^^] < 00 
and X^fclo E [v^^'^] < This requirement is, however, not necessary. As a further hypothesis, 
we will assume that Xlfclo^ \%pip^'^ x^^^ ^ 1 for all x G [—1, 1] which ensures that A{z) does 
not vanish for any z inside the unit circle. The behavior of A{z), as z goes to one, depends 
on the conditional expectation of if) given ip, namely g{p>) := E Either A(z) remains 

finite as z ^ 1, and Xt exhibits long memory if and only if Yt itself exhibits long memory or 
A(z) diverges hyperbolically, as z 1, and the memory (short or long) of the log-volatility 
process results from a mix between the properties of the distribution of (p and of the conditional 
expectation g{p>). More precisely, we can state that (see the proof in Appendix |B]) 

Proposition 2. Under the assumptions above, if the density of ip is f{ip) ~ (1 — ip)"", a G 

(0, 1/2), and if the conditional expectation ofip given (p is g{(p) ~ (1 — (pY, P > a — 1/2, as 
(p ^ 1^, the spectral density of the log-volatility process behaves as /x(A) ~ ^-2mm{a,/3}^ 
A ^0. 

As an illustration of this result, let us assume that (p Beta(— a, 1 + a). We deduce that 



l — Xif 



l—xip 



(1 — x)". If, in addition, we assume that E[?/'|(y9] = (1 — Lp)^, we get E 

r(i+a)r(i+/3) -^(-*-' ~'^'-> 1 + /^i where F is the hypergeometric function (see Abramowitz and 
Stegun (1965) for the definition). Thus, equation (fT6l) writes 



'1 -x)" 



k=0 



1 - 



T{l+a + f3) 
r(l + a)r(l + /3) 



(19) 



xF{l, -a; l+(3;x) 
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Under the assumption that P > 1, which assures that the denominator in equation (1191) 
remains bounded for all |x| < 1, the series YlT=o l^^x^ behaves like C(l — x)" in the neighbor- 
hood of 1 and thus diverges hyperbolically if a < 0. The expression of the spectral density is 
then deduced from equations (fT5l) and (fT9l) : 

I -| ^— *A 1 2a 

(20) 
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1 r(l+a+/3) g-iAFf1 1 + Q e-*^l 



r(i+a)r(i+/3)" 

and, since ~ C"|Ap°, the auto-correlation function reads p{h) ~ as \h\ 

A — >^0 

oo, provided that the second order stationarity condition a > —1/2 holds. 



For the sake of completeness, let us discuss the role played by the two parameters a and P 
in the model. As (p is assumed to follow a Beta(— a, 1 + a) law, then E[(p] = —a G [—0.5, 0]. 
It implies that the anticipation of an agent is not mainly based on her past anticipation. Re- 
call that \ip\ should be close to one in order to observe a strongly self-referential behavior. In 
other words, the agents attach more importance to the realization of the past volatility and to the 
information flows, on average. The closer a to 0, the less importance is given to the past antici- 
pation. Consequently, as the anticipations of the agents are more based on common facts (past 
realized volatility and informations), then the heterogeneity of the agents' beliefs decreases. As 
a conclusion the long memory should decrease. This phenomenon is shown on the left panel of 
figures vvhere auto-correlation functions for a fixed /? = 1.5 and different values of a ranging 
in [—0.45, —0.05] are drawn. 



[Insert figure^about here] 



The right panel of figure |4] shows the densities of ip for the same set of values of a. This 
figure confirms that the closer a to -0.5, the stronger is the divergence near 1. It means that the 
relative proportion of agents attaching importance to their past anticipation increases and thus 
the long memory increases. Furthermore, we notice that the speed of the decrease of the auto- 
correlation function and the divergence of the density function are strongly related. It means 
that a impacts both on short and long memory, which is not a surprise since - with our simple 
parameterization - a controls both the behavior of the density of ip close the one and in the 
neighborhood of zero. 

As for (3, in order to illustrate its role, we focus on the other heterogeneity parameter %}) 
whose mean is E[V'] = Y{i+lycii+0) ■ limits of E[ijj] when (3 tends to 1 and to infinity are 
respectively E[ip] — > 1 + a and E[V^] — > 0. So, the larger /?, the closer to on average. 

Now, by aggregation of the system (fTTj) . we obtain that the realization of the volatility on 
day t is equal to a quantity depending on the past anticipations plus the past realization of the 
volatility times the mean of -i/'i plus the information flows. Thus the aggregation of equation (fTTI) 
could be interpreted as an an AR(1) process where E[^] would be the auto-regressive coefficient. 
Even if the quantity depending on the past anticipations impacts on the short memory as we 
previously pointed out, all things being equal furthermore, when E[ijj] decreases there are less 
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short-term correlations and then the short memory falls. It is the reason why when E[^] gets 
close to (that is equivalent to (3 tends to infinity), the short memory drops (see figure [5]). 

[Insert figure \5\ about here] 



2.3 Examples 

Let us now discuss in details several typical examples of dynamics encompassed by our model 
and relate them with the agents' behaviors. 

2.3.1 Case 1: Rational agents 

As recalled in the previous section, rational agents base their anticipations at time t on the 
innovations Xj — X„ s < t — \. Thus, setting = — y^j, for alH = 1, . . . , n allows us to 
model the behavior of the realized volatility when the investors are fully rational agents since 
it allows us to retrieve the expression of the optimal predictor Q, in the particular case where 
the agents focus on the last innovation Xi,t-i — Xn^t-i only. In this case, the dynamics on the 
individual anticipation writes 

Xi^t = '^i- (Xi^t~i- Xn^t-ij + Ci- et + r]i^t ^ > 0, i = 1, 2, . . . , ra, (21) 

oo 

and by equation (fT6l) . we immediately obtain j3kX^ = 1, so that 

Xi = E[c]£i. (22) 

Notice, in passing, that we do not need each agent to be rational by setting z/^j = —f^i, but 
only that E[^|(p] = —ip which means that the agents are not necessarily individually rational 
but only on average. Thus, if the agents are rational (individually or in average), the excess of 
the realized log-volatility only reflects the aggregated information, publicly available at time 
t, namely E [c] et- This result is not surprising insofar as rational investors perform optimal 
anticipations and, therefore, the realized volatility can only convey the piece of information not 
present in the past innovation Xi^t-i — Xn^t-i- Then, since the information flow is assumed to 
be a white noise, the realized volatility should also be a white noise in the case where all the 
investors were rational. 

2.3.2 Case 2 : Absence of heterogeneity 

As a second basic example, let us assume that the investors only focus on the past realized 
volatility and completely neglect their past anticipations, i.e., ^9 = 0. Consequently, the dynam- 
ics of the individual anticipations writes 

Xi^t = i^i ■ Xn,t-i + CiEt + r]i^t t>0, i = l,2,...,n. (23) 



12 



By summation over all the agents i, and in the limit n ^ oo, one gets 

Xt=E[V']-X„,i_i+E[c]£t t>0, (24) 
which is a simple AR(1) process exhibiting only short memory since 

p{h)^B[^|Jf^. (25) 

Thus, as noticed in the previous section, we remark again that the heterogeneity coefficient (p 
is responsible for the long memory while %Ij mainly impacts the short memory. It clearly shows 
that the long memory phenomenon is rooted in the self-referential behavior of the investors, and 
more precisely in the self -reference to theirs own past anticipations and not to the past realized 
volatility. 

2.3.3 Case 3: Absence of reference to the past realized volatility 

At the opposite of the previous case, let us consider that each agent only bases her present 
anticipation on her past anticipation and on the flow of incoming news, but that she neglects the 
past realized volatility so that the dynamics of the individual anticipations reads 

Xi^t = ■ ^i,t-i + CiEt + r]^ t>0, i = l,2,...,n, (26) 

and the expression of the Pk's then simplifies to 

Pk = E [v^^] , (27) 

which allows us to write the excess realized log-volatility and its auto-correlation function as 

oo 

Xt^E[c]J2^[ip>']et-k, and p{h) ^ ^ , (28) 

A;=0 

oo 

provided that the stationarity condition ^(E[<^'''])^ < oo holds. 

fe=0 

It turns out that the properties of this dynamic has been investigated by Gon9alves and 
Gourieroux (1988). Based on their result, one can remark, in passing, that this the correlation 
function, namely the auto-correlation function of the aggregated time series, is very different 
from the average correlation function p. Indeed, before the aggregation, the auto-correlation 
function is of each individual agent's expectation is 

p,(/i)=^f, i = l,2,..., h^O,l,... (29) 

so that 

p(/i)=E[vpl'^l] /i = 0,l,... (30) 
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Besides, the dynamics exhibits long memory if (and only if) the density of diverges at one 
in order to get a hyperbolic decay of the correlation function (1281) . In other words, the density 
should behave like {\ — x)^'^,d < 1/2, in the neighborhood of one, that is to say like the density 
of the Beta law for instance, in order to obtain p{h) ~ A ■ \h\'^'^~^, as \h\ goes to infinity. In such 
a situation, one can conclude that a significant part of the agents base their anticipations on the 
one they have made the day before. 



2.3.4 Case 4: Deviation from rationality 

Let us now focus on a particular case of bounded rationality agents. For, we assumel^ if) = 
—a ■ + Lf', where a and ^p' are two constants while if follows a "stretched" Beta(p, q) law, 
namely a Beta(p, q) law extended over the entire range [—1, 1]. As we expect that the fraction 
of investors characterized by a heterogeneity parameter Lpi close to one be larger and, in fact, 
diverges at 1, the parameter p must be larger than one: p > 1. On the contrary, we expect that 
the fraction of agents characterized by a heterogeneity parameter close to minus one remains 
finite, so that q must be smaller than one: < g < 1. The case of rational agents is obviously 
encompassed by this representation, since it corresponds to the situation where a = —1 and 
<f' = 0. 

In this setting, the dynamics of the anticipation of the agent i becomes 

Xi^t = ■ ^i,t-i + i'f' - ctipi) ■ Xn,t-i + Ci ■ Et + T]i^t t>0, i = l,2,...,n. (31) 

and, from equation (fT6l) . the excess log-volatility Xt follows the infinite order moving-average 
(fTSl) with coefficients (3^8 given by the relation 



^ 1 — a + [a — xip')[l — x)i ^G{p,q,x) 

where 

X j{l + x)-'i ■ F{p + q-l,q;p + q,j^) if x > 0, 
G[p,q,x) = < ^ ^ ^ o . (33) 

^ [l-x)-^-F{l,q;p + q,j^) ifx<0, 

withF(., .;.,.) is the hypergeometric function. Two conditions are required in order for ^(p, q, x) 
to remain finite for all |x| < 1: p > 1 and < g < 1. Since < g < 1, the numerator diverges. 
Let us study the behavior of the denominator, and figure out the conditions for its convergence. 

f^^^^ {l-a){l-xy-i + {a-x<f')G{p,q,x) x^i a-^'' 
which yields the following condition for long memory 

ifi' = a. (35) 
The dynamics of the anticipation of the agent i then becomes 

Xi^t = Vi ■ ^i,t^i + <^i^ - Vi) ■ ^n,t-i + Ci ■ St + r]i,t t>0, i = l,2,...,n. (36) 



^Remark that most of the results in the section still hold if we only assume E [i/ijiys] = —a ■ (p + (p'. 
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When ipi is close to 1 then a(l — ipi) is close to and inversely when ipi is close to then 
a{l — ifi) is close to a. Finally, when is close to -1, then a(l — is close to 2a. There 
is a kind of balance for an agent to the weight she gives to her past anticipation and to the past 
realized volatility. The more importance she gives to her past anticipation, the less the past 
realization matters. And inversely. Besides, when an agent decides to base her anticipation 
contradictory to her past anticipation (corresponding to ^9 = —1), for example she thinks she 
was wrong, then she also gives and important weight to the past realization via the coefficient 
2a. This is a reasonable behavior. 



(37) 



So, under the assumption that the denominator in equation (1371) remain bounded for all \x\ < 1, 
which requires that 

1 — n 

<a<l, (38) 

q 

the series YlT=of^kx'' behaves like C(l — x)^^^ in the neighborhood of 1 and thus diverges 
hyperbolically if g < 1. In such a case, the auto-correlation function behaves like K\h\^~'^'^ as 
— s> oo, provided that the second order stationarity condition q > 1/2 holds. 

Now, we will study the influence of the parameters on the shape of the auto-correlation 
function. First we focus on the short memory case where cp' ^ a, then on the long memory case 
where (p' = a. 



Replacing Lp' by a in equation (1321) leads to the simplified expression 



Short memory In the case ^ a, the condition for the denominator not to vanish becomes 

a > max (^ip', (^+^^^'+^ ^ if ^ > 1 and < a < (^+^^^'+^ if _9_ < i_ We setp = 5, 

q = 0.75 and a = 0.8. Figure [6] shows how the auto-correlation function behaves when (p' 
tends to a. We observe that the nearest cp' is to a the slower the decrease of the auto-correlation 
function. 



[Insert figure^about here] 



As a special case, let us set a = 1, as in the case of rational agent, but ip' ^ 0. It is, in some 
sense, the simplest way to account for the departure from rationality. The anticipation of the 
agent i then writes 

Xi^t = 'Pi ■ (j<^i,t-l - Xn,t-l^ + <p' ■ Xn,t-l + Ci-£t + Tji^t t > 0, i = 1,2, ... ,71, (39) 

which means that the agent performs her anticipation on the basis of the past innovation Xi^t-i — 
Xn,t-i but, in addition, also pays attention to the past realized volatility in itself, as pointed out 
by the presence of the term (p' ■ Xn,t-i- 
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Equation (1161) yields 

oo ^ 

■ (40) 



1 — X ■ ip' 

which shows that the dynamics of the (excess) realized log-volatility is nothing but a simple 
short memory AR(1) process 

Xt-^'- Xt-i = E[c]et, (41) 
whose auto-correlation function is given by 

pih) = ^'I'^l (42) 

Thus, if if' = 0, the volatility exhibits exactly the same behavior as in the case of fully rational 
agents (case 1), which shows again that the rationality at the individual level is not a necessary 
condition for the volatility, at the aggregated level, to behave as if the agent were individually 
rational. 



Long memory Let us now focus on the case where the model exhibits long memory, namely 
when if' = a. We are interested in the role of the parameters p, q and a in the short and 
long memories. On figure |71 the role of p is depicted. The two other parameters q and a are 
fixed respectively to 0.75 and 0.3 and p ranges between 1.05 and 8. On the left panel the auto- 
correlation functions are drawn over 200 lags, and on the right one the corresponding density 
functions. It shows that the larger p, the slower the decrease of the auto-correlation function at 
short lags. Thus, p only influences the short term behavior of the auto-correlation function. 



[Insert figure^about here] 



On figure [HI the role of q is studied. The two other parameters p and a are fixed respectively 
to 5 and 0.3 while q ranges between 0.55 and 0.85. The closer q is to 0.5, i.e. to the lower 
bound for second order stationarity, the faster is the divergence of the density function in the 
neighborhood of 1 and the slower is the decay of the auto-correlation function. Consequently, 
q impacts on the long memory. 



[Insert figure\8\about here] 



Figure [9] shows the role of a. The two other parameters p and q are fixed respectively to 5 
and 0.75 and a ranges betwwen 0.1 and 0.9. We observe that a has an important impact on the 
short memory. Indeed, the smaller a, the slower the short-lag decrease. 



[Insert figureMabout here] 
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Let us take now into account a larger class of agents' behavior in the case of = a. To this 
aim, we modify the density function: we still keep the beta law but we add another law which 
have a bell-like density. The bell will be able to move all over [-1,1]. For example, if the bell is 
set around zero, it means that the agents do not give importance to their past anticipation. 

(43) 

where K is the normalizing constant (that can be expressed in closed-form) law and w in [0, 1] 
is the weight given to the stretched Beta law.The conditions of long memory remain the same, 
only the conditions of stationarity change. They can be easily expressed in closed-form but their 
expression is rather cumbersome; That is why we will not give them here. 

On figure [lOl the role of the relative weight of the singular part the density, w, is studied. 
The parameters p and q are fixed respectively to 5 and 0.75, a = 0.3, m = 0, cr = 0.2 and w is 
ranged over [1/4, 3/4]. The weight given to each density impacts on the short but not the long 
memory. The larger the relative part of the bell-like density compared to the stretched Beta one, 
the faster the decay of the short memory. In terms of agents' behavior, it means that the more 
the agents base their anticipations on the incoming information flows and on the past realization 
of the volatility, the smallest the short-term correlations are. 

[Insert figureUOlabout here ] 



Figure [TH shows the impact of the width of the peak of the bell controlled by a. The 
parameters p and q are fixed respectively to 5 and 0.75, a = 0.3, m = 0, w = 1/2 and a ranges 
between 0.1 and 0.3. The intensity of the peak of the bell-like density has only a little impact 
on the short memory. The sharper the bell is, the more important is the short memory. 

f Insert figure 1771 about here] 



On figure [l2l the role of m which determines the position of the bell is studied. The param- 
eters p and q are fixed respectively to 5 and 0.75, a = 0.3, a = 0.2, w = 1/3 and m is ranged 
over [—0.8, 0.8]. The translation of the bell plays a part on short memory. Over the very first 
lags, we notice that the closer the bell is to -1, the faster the decrease of the auto-correlation 
function is. But after a few lags, this situation does not hold anymore. 

[ Insert figure [72] about here ] 
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3 Statistical Inference 



Let us now turn to the question of the estimation of the parameters of the model. As exempli- 
fied in the previous section, the long memory phenomenon can be ascribed to the hyperbolic 
divergence of the density of the heterogeneity parameter Lp in the neighborhood of 1~. It is thus 
convenient to split the density of the heterogeneity variable ip in two terms: a regular one and 
a singular one. For this reason, we will model the law of the realized log-volatility of financial 
assets as the mixture of a regular law, with density /i, and of a Beta(2, 1 — d) law {d e]0; 1[) 
with density /2, that will allow capturing the hyperbolic divergence of the density of at one. 
As a consequence, the density of Lp can be written as 

f{x) = wh{x) + (1 - w)h{x), w G [0, 1], (44) 

where /i is any regular, i.e. continuous and bounded, density function defined on [—1, 1] and 

Mx) = (1 - d)(2 - d)-^--l[o.,iiix). (45) 



Given a density /i(x) which admits an expansion in terms of a Fourier series, an assumption 
that will be made in all the sequel of this article, 

^ oo oo 

an cos(n7rx) + ^ &n sin(n7rx), x e [-1, 1], (46) 

n=l 71=1 

the expression of the auto-covariance and auto-correlation functions of the realized log- volatility 
can be numerically calculated by use of the expression of the moment of order k of the hetero- 
geneity variable (p 

E [ip'] = wE^ l^"] + (1 - w)E, [^'] , (47) 

where Ei[-] and E2[-] denotes the expectations with respect to fi and /2 respectively, which 
yields 

(1 °° oo \ Y(k + 2) 

-Ak + J2(^nBn,k + J2bnCn,k j + {I - w) ■ - d) , (48) 

71=1 n=l / ^ ' 

where the expressions of Ak, and Ck are given in appendix iDl 



3.1 Asymptotic Normality of the estimator 

For simplicity, and for ease of the exposition, we will assume that the coefficients (a„, 6„) of 
the Fourier expansion (|46l) vanishes beyond the rank q, so that the density /i reads 

/i(^) ~ 2 cos(rj-7rx) + sin(n7ra:), 1,1]. (49) 

n=l 71=1 

In addition, we focus on the case where E [^Iv?] = —a{'p — 1), which ensures that the long 
range memory of the volatility is controlled by the parameter d of the Beta law, as shown in the 
previous section. 
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Let us denote by 6* = {ai, . . . ,ag,bi, . . . ,bg,a, w, a^, d)' the 2g + 4 dimensional vector of 
the parameters involved in our problem and by 7(/i; 6) the value of the auto-covariance function 
at lag h for the distribution of heterogeneity defined by equations (|44)) . (|45l) and (|49|) and for the 
parameter value 6. 

Given the T-sample of the logarithm of the realized volatilities {cji, . . . , ujt}, treated as ob- 
served data, it seems natural to estimate by minimization of the weighted difference between 
the sample auto-covariance function 

^T{h) = —^{uJi-U)T){'^i+h-^T), /l = 0, 1,... (50) 

i=l 

and 7(/i; 0), the auto-covariance function at the parameter value 0. We could thus consider the 
classical minimum distance estimator Ot^l (Newey and McFadden 1994) solution to 

min [7r,L - 7l(^^)]' W^^ [7t,l - 7l(^)] , (51) 

where 7^(6!) = (7(1; 9),..., 7(L; 6))', ^t,l = (7t(1), • • • , 7t(^))' and Wl is any symmetric 
positive definite L x L matrix, while 6 is the parameter set 

6 = {^^ = (ai, . . . , Og, 61, . . . , bq, a, w, a, d)'; /i > 0, < d < 1 and < w < 1} . (52) 



However, when dealing with long-memory time series, i.e such that J2'h=o l7(^)l = '^he 
asymptotic properties of the sample estimates 'Jt,l are not really suitable. Indeed, as recalled 
by Hosking (1996), the limiting distribution of 'jrih) when c? > is such that 

^l-2^(7H/^)-7W)^7^, (53) 

where TZ denotes the modified Rosenblatt distribution. In particular, the mean of the Rosenblatt 
distribution is not equal to zero and can even be much larger than its standard deviation for d 
larger than or of the order of one fourth. 

As a consequence, it is desirable to rely on the minimization of another criteria with more 
suitable asymptotic properties. In fact, irrespective of the value of d E (— i, i), the limit 
distribution of any subset of the variables Dh = VT [{'^rih) — 7t(0)) — {^{h) — 7(0))], h > 1 
is a multivariate normal with zero mean and asymptotic covariance matrix (see Hosking (1996, 
th. 5)) 

= hm Coy {Dk,Di), (54) 

L J HI T— >oo 
^ 00 

= 2 5Z [l{s)-l{s-k)-^{s-l)+^{s~k + l)]\ (55) 

s=— 00 

In view of this asymptotic result, a convenient estimator of the parameter 9 is given by the 
solution to 

min [fiT,L - W^' [^T,L - Viie)] , (56) 

where 

VLie) = (7(1; 9) - 7(0; 9),..., ^{L; 9) - 7(0; 9))' , (57) 
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and 



1T,L = (7t(1) - 7t(0), . . . , 7r(^) - 7t(0))' . 



(58) 



The consistency and the asymptotic normality of the estimator 9t,l follows from the general 
asymptotic properties of the classical minimum distance estimators. Concerning the asymptotic 
normality, we can state the following result 

p 

Proposition 3. Assuming that 6t,l — ^ Oo, for all L > 2q + A, as T — > oo, 

Vt (eT,L - 0o) ^AfiO, AL{eo)) , (59) 

with 

Al{9o) = [GLi9oyWLGL{9o)]-' GL(^^o)'W^LSi(^o)W^LGL(^^o) [Gl(^o)'W^lGl(^^o)]"' , (60) 
where 

Li9o) = gTadgriL{e)\g^g^ , (61) 



G 



and 



-. OO 

- J2 b{s;9o)-i{s-k;9o)-i{s-l;9o)+i{s-k + l;9o)]\ (62) 



/, /l — 1, . . . , L. 



Proof. By theorem 5 in Hosking (1996), VT {fiT,L - Vl{9o)) ^ A/" (O, S^(0o)), thus, the 
result follows straightforwardly from theorem 3.2 in Newey and McFadden (1994). □ 



For fixed L, the minimum of the asymptotic variance Al{9o) is reached when Wl 



El{9o)] \ so that 



T (9t,l -9o) (O, GL{9oy [T^'M] G^] 

On the other hand, given Wl, one can get an optimal value L* of L as the solution to 



w cos(7rx) 

w cos(g7rx) 
w sin(7ra;) 

w sin(g7ra;) 


flix)-f2ix) 





_ {w-l)f2{x) ln(l-x) + 



3-2d 



-d){2-d) 



(63) 



L* = arg min \\Al{9^)\Y. (64) 

LG{2(7+4,...,T-1} 

Proposition 4. Under the assumptions in proposition\3\ denoting by D{9) the vector 



(65) 



20 



the estimator f{x) of the density f{x) is asymptotically Gaussian 

Vf (/(x) - fix)) ^ M (0, D{9,yA{e,)D{eo)) . m 

Proof. D(6) is nothing but the gradient of f(x) with respect to 9. Thus, by use of the Delta 
method (van der Vaart 2000), the result follows straightforwardly from proposition [3l □ 

4 Empirical results 

In this section we present the conclusions drawn from the calibration of our model whose im- 
plementation is discussed in appendix El To this aim, we use the intraday prices of ten middle 
and fourteen large capitalization stocks traded on the NYSE or the Nasdaq from 01/01/1994 
to 12/31/2003 (which represents 2518 trading days). The description of the data, provided by 
TickData, is given in table [U 

[Insert table\^about here] 



We first estimate the daily realized-volatility by use of the estimator ©, as already discussed 
in section [T] We stress that this variable will be considered as an observed variable in all the 
sequel. Before going further, it is worth to notice that we should get 390 one-minute prices 
for all assets since the quotations begin at 9:30 am and end at 16:00 pm. However, we have 
much less observations for the middle capitalization stocks than expected: typically 133 per 
day, on average. In addition, the number of intraday quotations is not constant over time. 
Indeed, the number of available data is much smaller from 1994 to 1997 than from 1998 to 
2003 (respectively 45 and 192 for the middle capitalizations and 282 and 374 for the large 
capitalizations). Nevertheless, we has chosen to estimate the realized volatility over the whole 
time period ranging from 01/01/1994 to 12/31/2003. 

4.1 Calibration of the model over the whole time interval 

Let us underline that the estimation of the density of the heterogeneity variable provides us 
access to the parameter d that characterizes the long-memory behavior of the time series of the 
realized log-volatility. It is then interesting to compare the values of d obtained with our model 
and those obtained by other methods like the rescaled rang method set by Hurst (1951) and the 
regression method introduced Geweke and Porter-Hudak (1983). Notice that the former method 
has been refined by Mandelbrot (1972, 1975), Mandelbrot and Taqqu (1979) and later by Lo 
(1991). However, his generalization involves an additional parameter whose value has a great 
influence on the results (Teverovsky, Taqqu, and Willinger 1999). As a consequence, we have 
only resorted to the classical rescaled range statistic. 

First we show the graphical results obtained for a middle capitalization: Fidelity National 
Financial inc. (FNF, see figure [TS]). On the right panel the estimated auto-correlation function 
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fits very well the sample one. The density of the heterogeneity variable is depicted on the 
left panel (plain curve) with the pointwise 95% confidence interval (dashed curves). This shape 
with three distinct masses (one close to -1, an other close to 1 and the last one around 0) is 
representative of half of the assets (The other half is depicted afterwards). In terms of agents' 
behavior, it means that there is mainly three kinds of agents. First, looking at the central part 
of the density function, we can conclude that most of the agents base their anticipations on 
the incoming flow of information and on the past realization (depending on the value of d). 
Some others believe in the anticipation they performed the day before, which is related to the 
diverging part of the curve, in the neighborhood of one, while only a few, shown by the part of 
the density near -1, systematically take the opposite of their previous anticipations. 

[Insert figure\T3\about here] 



Secondly we show what we graphically obtain for the other half of the assets with the 
example of a large capitalization: Microsoft (MSFT, see figure [14]). Instead of noticing three 
distinct masses, we only see two. The agents confident in their past anticipation still remains, 
but it is now more vague with the second category. In fact the curve is symmetric with a peak 
centered around -0.5. We still find agents who do not trust anymore in the anticipation they 
performed the previous day (the part of the density function near -1), and others who only 
care about incoming news (the part of the density function near 0), and between these two 
situations there are many agents who both take into account the news and the opposite of their 
past anticipation. This is quite logical if an agent realizes that her past anticipation was far from 
the realization of the volatility, then she takes the opposite of her previous anticipation and also 
becomes more careful in the incoming news. 

[Insert figure\T4\about here ] 



These two typical shapes of the distribution of heterogeneity does not appear to be related 
to the size of the firms nor to any specific industry. However, the small number of assets per 
industry in our database (maximum five) prevents us from drawing definitive conclusions. 

The results of the estimations of the long memory parameter d by our model and by the two 
semi-parametric methods are shown in table |2l It is quite obvious that, on the one hand, each 
estimators agrees to say that long memory is present in almost all series while, on the other 
hand, each estimator gives, for a same asset, different values of d. Most of the values obtained 
by our model range between 0.2 and 0.4 and a few are negative, instead of between 0.3 and 0.45 
with Geweke and Porter-Hudak's estimator. At least the long memory parameter obtained by 
the classical rescaled range analysis method is mainly contained between 0.40 and 0.55. 

It is common knowledge that beyond d = 0.5 the series is no longer stationary. As the 
estimations by the Hurst method are greater than, but close to, 0.5, we may say that it is due 
to the uncertainty and, maybe, the inaccuracy of the method. Moreover, the differences noticed 
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between these results may be explained by the difficulty to use the estimators. Indeed, it is 
well-known that Geweke and Porter-Hudak's estimator is quite sensitive to the presence of 
short memory. On the contrary, our estimates should be considered as more robust vis-a-vis the 
presence of short-memory insofar as our model takes it into account explicitly. 

4.2 Study of the bubble burst effect 

The study of the bubble burst effect is motivated by the fact that we will be able to get infor- 
mations about the agents' behavior during a prosperity period (before the bubble burst) and 
during a recession period (after the bubble burst). In particular we will be able to answer the 
question:"Do the agents behave differently before and after a bubble burst ?" 

To perform the computations, we randomly selected 5 large capitalizations because the bub- 
ble phenomenon is far more pronounced in their price evolution than it is for middle capitaliza- 
tions. We simply split the series when the maximum price is reached into two subseries. The 
first one is defined as the pre-bubble burst period while the other one is the post-bubble burst 
period. Hereafter, the illustrations are drawn for a "new technology" company, Cisco Systems 
(CSCO), and a distribution company. Coca Cola (KO). The point is that the bubble phenomenon 
may be more pronounced in the price evolution of a new technology company, which was more 
affected by the internet bubble, than a non technology firm. 

4.2.1 Study of a new technology asset: Cisco Systems 

On the left panel of figure [15] we show the price evolution of CSCO between 01/01/1994 and 
12/3 1/2003. On the right panel of figure[T5]we have drawn the sample auto-correlation functions 
of CSCO over the three periods. From it we easily deduce that on the one hand before the bubble 
burst the long memory parameter may be much larger than the one over the whole period, and 
on the other hand after the bubble burst, it must be a lot smaller. 

[Insert figure\T5\about here] 



Let us complete these impressions with the densities and auto-correlation functions esti- 
mated over the two subperiods (see figures [T6] and [TTI) . We show on the left panel the estimated 
density with its 95% confidence interval and on the right one the sample and estimated auto- 
correlation functions. 

Concerning the pre-bubble burst period (see figure [T6l) . on the right side, we see that the 
estimated auto-correlation function fits pretty well to the sample one. Moreover on the left 
side of this figure, the 95% confident interval is very thin. We deduce that the optimization 
performed well. 

The shape of the density is quite similar as the one we obtained with Microsoft (see figure 
[T4l) and implies a strong long memory. To sum up, we get two masses: the one close to 1 
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represents agents who believe in their past anticipation; the other one, from -1 to 0.2 includes 
several different behaviors. Nevertheless they have something in common: they do not replicate 
their past anticipation. Some are rational and use the incoming information flows and others not 
really. They are inspired by the opposite of their past anticipation or a mix between the news 
and the contrary of the previous anticipation. 

[ Insert figure UM about here] 



As for the post-bubble burst period, we first notice on figure [17] that the estimated auto- 
correlation function fits very well to the sample one (on the right panel) and that the 95% 
confident interval of the density function is good too. 

Here, the situation is a little bit different: the value of the long memory parameter d is 
smaller (see table |3] hereafter). Basically, it means that the proportion of agents who believe 
in the continuity of the previous market conditions is smaller. Figure [18] helps us compare the 
agents behaviors during the two subperiods. We observe that after the bubble burst the mass 
from-1 and 0.2 changes. It was first compressed between -1 and 0, then the mass has got divided 
into two masses. As a consequence, on the one hand the proportion of agents who perform 
their anticipation basing their judgment on the contrary to their previous anticipation is greater. 
Those agents may recognize that they were wrong or it can reflect the confusion resulting from 
the increase in the level of uncertainty about the future evolution of the economic environment 
after the bubble burst. On the other hand, the proportion of agents who mainly use information 
is greater too, in accordance with the increase in the level of uncertainty and therefore with the 
increasing need for information. 

[Insert figureUTlabout here] 



[Insert figure\T8\about here] 



4.2.2 Study of a non technology asset : Coca Cola 

We reproduce for Coca Cola the same figures as for Cisco Systems. We notice on the left side 
of figure[T9]that the bubble phenomenon is less obvious as observed previously. The immediate 
consequence is shown on the left side of this figure where the sample auto-correlation functions 
over the three periods are not as different as they are for CSCO. Then we expect to get closer 
long memory parameters, especially over the whole period and after the bubble burst. 



24 



[ Insert figure [791 about here] 



Let us be more accurate with the densities and auto-correlation functions estimated over 
the two subperiods (see figures [20l and [2T]) . For both of the figures, notice that the estimated 
auto-correlation functions on the right panels fit very well to the sample ones (in particular after 
the burst), and the 95% confident intervals are good too. 

[ Insert figure |20| about here] 



f Insert figure \21\ about here] 



As the shapes of these two densities look very similar let us put them together on the same 
graphic in order to analyze them (see figure [22]) . Looking at the divergence at one, the long 
memory parameter may be more or less the same. Contrary to the evolution of CSCO density 
where the bulk of the post bubble burst density in the negative range was compressed, here it 
does not change that much. It only slightly expands along the positive axis after the bubble 
burst. The mass from 0.2 to 0.7 for the pre-bubble burst period which represent the agents who 
both believe in the continuity of the market and also take into account the information flows has 
disappeared. It means that a smaller fraction of agents still use their past anticipation. 

/ Insert figure |22] about here ] 



4.2.3 General observations 

Let us look at the values of d. The values of the long memory parameter obtained by our 
model (see table [S]) confirm our visual impressions. Indeed, before the burst, the long memory 
parameter is often greater than the one over the whole period for 4 out of 5 cases (in the 5th they 
are nearly similar). On the opposite, one observes that after the burst, d falls for all the assets 
except for Coca Cola. 

We have to mention that the very large standard deviation of the rf-estimate for Microsoft 
during the second period should raise doubts about the convergence of our algorithm in this 
case. A reason might be the small number of available data. Indeed, for the post bubble burst 
period we only have about 900 days for 4 out of 5 assets whereas we have about 1600 data 
available for the pre bubble burst period. In the case of a smaller data number, the global 
optimum is more difficult to reach. Consequently the result may be wrong. 
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Nonetheless, to sum up, the results we obtained lead us to conclude that during a growth 
cycle, the number of agents who are confident in the market and whose strategy remains almost 
the same day after day, is greater than the one during a decline cycle where the agents adopt 
more various behaviors. Indeed, some still base their anticipation on the previous one. Some do 
not believe in their past anticipation anymore and take into account the information flow. Some 
admit they were wrong the day before and make a contrary anticipation. Others make a mix 
between a contrary anticipation and taking into account the news. 



5 Conclusion 



Based upon the recent literature on the aggregation theory, we have provided a model of realized 
log- volatility that aims at relating the behavior of the economic agents to the long memory of the 
volatility of asset returns. In spite of its simplicity, this model allows taking into account many 
agents' behavior and performs good estimates in general. The estimated coefficients often lead 
to auto-covariance and auto-correlation functions well fitted with their sample counterparts. In 
addition, the results derived from the study of the bubble burst effect - namely a higher tendency 
to replicate the anticipations of the day before and to neglect the incoming information flow 
before the bubble burst than after - are quite reasonable. 
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A Proof of proposition [T] 

Under the assumption \ \A\ \ < 1, the stationary solution of equation (fT2)) is given by 

oo oo 

Xt = Yl A'Cet-k + Yl ^'Vt-k, (67) 

fc=0 k=0 

and the excess realized log- volatility writes 



k=0 ^ ' fc=0 

Accounting for the fact A = D ^ ^ ■ 1^, is solution of 



(68) 



= A^~^ ( D + ■ iM , (69) 



n 



so that one has to solve the recurrence equation 

I'^A^ = {I'^A^-') D + (70) 
where I3k = j^lnA'^tp. It is then a matter of simple algebraic manipulations to show that 

k 



-I'^A^C = h[^D'C+-y2P^-l■K.D'-'C, (71) 
n n n ^-^ 

i=l 

k 

E [c] ■ E [p''] + E [c] ^ A-iE [p'"'"] (a.s), as n ^ cx), (72) 



4 = 1 



where 



(3k 



-I'^D'^ + V A^i ■ (-I'^D^-'A , (73) 
i=i ^ ^ 

k 

E [ip^^] + ^ A_i ■ E [ip^^-'] (a.s), as n oo, (74) 



i=l 



while 



^In^N*. = ]^l'^D%^ + ^Y.^^-^-^'nD'-'Vt,. (75) 

i=l 
k 

= E[^']E[7]t] + Y,f^i-i^[v'~']'^[Vt] (a.s), asn^cx), (76) 
= 0, (77) 
provided that E[T]t] = 0. 
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So, Xt = lim„_oo Xn,t is equal to 

00/ k \ 

= E [c] ■ ^ E [yp^] + J2 A-iE [f'-'] Bt-k. (78) 

fc=0 \ i=l J 

Let us simplify this expression by setting 

k 

4 = E[<^'=]+^A_iE[<^'=-], (79) 

i=\ 

SO that 

00 

Xt = E[c]-^te_fc. (80) 

fc=0 

Now, multiplying equation (l79l) by and summing over from zero to infinity, we get 

00 00 00 / k \ 

Y.~^k^^^'^' = J^E [/+^] x'^' + E E/^^E [/"T x'^^^ (81) 

fc=0 fc=0 A:=0 \j=0 / 

As it is well-known that 

00/fc \ / 00 \ / 00 \ 

E E [^^-] x'^' =xIY: Pk^ix' E E [^1 > (82) 

fc=0 \j=0 / \fc=0 / \fc=0 / 

we obtain 

E/^fc^' = Ee [^'] x^ll + xj^f^kx'}- (83) 

k=Q k=0 I fc=0 J 

00 

Then, focusing on ^ Pkx'^ and by use of equation (1741) . the same argument yields 

k=0 

00 00 00/fc \ 

J] = J]E t^^'^+^j x'^^^ + E E AE [V^] (84) 

fc=0 k=0 k=0 \i=0 J 

00 00 /oo\/oo \ 

^ Ax'^ = 5^E [tA/] x' + xl Y^P.x' Ee [^^'] ■ (85) 

fc=0 A;=0 \i=0 / \i=0 / 

In other words 

00 

EeM^' 

E Z^'^^' = ' (86) 

^=0 l-x^E[V<^'=]x^ 

k=0 
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which, by replacement in equation (|83l) . leads to 

oo 



X 



k=0 



k=0 



(87) 



X 



k=0 



E 


1 




1 — X ■ ip 


1 — xE 


\ 1 


1 — X ■ ip 



and, provided that the permutation of the expectation and summation signs is allowed 

oo 
k=0 

which concludes the proof. 

B Proof of proposition |2| 

The expression of the spectral density of Xt follows from proposition [Hand reads 

oo 



(88) 



□ 



-ikX 



k=0 



ikX 



k=0 



(89) 



Provided that the permutation of the expectation and summation signs is allowed, we can rewrite 
this relation as 



Now, focusing on the term 





E 


1 




1 - e-'^ ■ ip 
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1 - e-'^ ■ E 




1 - e-*^ ■ ip 



(90) 



N{x) = E 



1 



(91) 



1 — X ■ ip 

it follows from Karamata's theorem (Bingham, Goldie, and Teugels 1989) that N{x) ~ (1 — 

x)~°', as a; 1, provided that the density of ip satisfies f{ip) ~ (1 — ip)^°', a > 0. 

Similarly, provided that the density of ip satisfies the previous assumption and that the con- 
ditional expectation g{ip) ~ (1 — ip)^, P > a — 1/2, the term 



D(x) = E 



1 — X ■ ip 



(1-x) 



a> (3 
a < (3 



(92) 



as X — s> 1. 
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So, when a is positive and larger than [3, the ratio N{x)/ D{x) ~ (1 — x) ^ so that the 
spectral density ~ X^'^^. On the contrary, when /? is greater than of equal to a, the ratio 

N{x) /D{x) ~ (1 - x)"" and the spectral density /x(A) ~ A"^". □ 



C Calculation of the auto-correlation function by the use of 
the Fast Fourier Transform 



The Auto-covariance function 'jx{h) of a time series {Xf} can be obtained by the spectral 
density of {Xt} according to 

jx{h) = r e'^^fx{X)d\ = r e'^^fx{X)d\. (93) 

J-TT Jo 



Equation (|93] ) in terms of a sum of integrals 

^1 /.^(fc+i) 



— h^h 



(94) 



fc=0 " Tv ' 

By the trapezes method one can express an integral over a short domain such as 

fc=0 1^' 



E/Jr'^'^ e^'Vx(A)ciA = ^ (e^tV^ (|a:) + e^tC^+D/^ + 1))) . (95) 



As, given the shape of the spectral density, we may face problems in and 2%, lets us rewrite 
equation (|94] ) as 



— /•2n 

lx{h) = I " e*"Vx(A)rfA+ / e*"Vx(A)ciA 



TV— 2 

+ ^ E (^^^''/^ (f ^) + e^-^'^^^^Vx + 1)) ) , (96) 



,2k 



fc=0 



2n 
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fc=i 



(97) 



where the spectral density is deduced from equations (1151) and ([T9 



/x(A) 



cr 



II - e 



-iX\2a 



271 



r(l+a)r(l+/3) -^v-^' ; 



2 ' 



(98) 
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with .; ., .) is the hypergeometric function, and is the variance of Et, and 



/27r 



2a+l 



1 



1 



r(a+/3) 



r(i+a)r(/3) 



(99) 



The last term of equation (1971 ) can be easily calculated by the use of the inverse Fast Fourier 
Transform. 



D Auto-correlation function 



El [(f''] is given by 



where 



^ CO oo 

Ei[(p^] = -Ak + ^a^Bn,k + ^KCn,, 



n=l 



n=l 



(100) 



Ak= ( k+1 



if k is even, 
if k is odd, 



(101) 



n,k 



2k k{k-l) 



k{k-r 



B 



n,k~2 



Bn^k-2 if k is even, 
if k is odd, 



with Bnfl = and Bn,i = 0, 



(102) 



a 



n,k 



k{k-l) 

— '^n,k-2 



if k is even. 



(-ir^— - iffcisodd. 



with C„,o = and Cn,i 

E2 [a''] is simply given by 



nvr 



E2b1 



r(fc + 2) 

r(A; + 3-(i) 



r(3-rf). 



(103) 



(104) 



E Implementation of the econometric procedure and conver- 
gence of estimators 

For finite size samples, the distance {j}t,l — ViiO))* W^^^ {vt,l — Vl{0)) can exhibit several 
local minima and, in practice, it actually does. Therefore, it turns out to be necessary to use a 
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minimization algorithm that is able to deal with such a problem, preventing from being trapped 
in a local minimum, and then to find the global minimum. Genetic algorithms provide relevant 
solutions for such situations and they have been retained to solve our problem. 

The idea underlying genetic algorithms is based on the mimicry of the natural selection 
process and genetic principles. The genetic algorithm starts with a population of trial vectors - 
called genes - containing the parameter 9 to optimize and unfolds as follows: 

• The first step consists in the replication of the initial trial vectors according to their fitness, 
that is the genes whose distance is the smallest have the highest probability to reproduce. 
Thus, on the average, the new population has a smaller distance than the initial one, but 
its diversification is also weaker since the fittest genes obviously appear twice or more in 
the new population. 

• The second step is the crossover which leads to combine the different parameters from 
several vectors drawn from the new population in order to mix their characteristics. 

• The third and last step is the mutation, where some genes undergo random changes, i.e., 
some parameters of the vectors born of the crossover are randomly modified. This step is 
essential to maintain the diversity of the population which in turn ensures the exploration 
of the whole optimization space. 

The vectors obtained after this third step are then used as initial population and the process 
is reiterated in order to get a new generation of genes and so on. The convergence of this 
algorithm to the global minimum of the problem is ensured by the fundamental theorem of 
genetic algorithms (Golberg 1989). An example of particularly efficient genetic algorithms is 
the Differential Evolutionary Genetic Algorithm by Price and Storn (1997) or the Dorsey and 
Mayer (1995) algorithm. 

As the the genetic algorithm isparticularly time consuming, we have turned to the Nelder 
and Mead (1965) simplex methoqj, that is a multidimensional unconstrained nonlinear mini- 
mization algorithm. This method presents however a serious disadvantage in our case : from 
the starting point chosen to initialize the procedure, it finds the nearest local minimizer of the 
function. In the case we are interested in, we know there are many local minima, thus an inad- 
equate choice of the initial value leads to a local minimum instead of the global minimum we 
are looking for. 

In order to bypass this problem, we have developed an iterative procedure hereafter called 
the "step by step" method, based on the Nelder Mead method, that has appeared very fast and 
efficient when the coefficients (a„, 6„) in (|49l) decay at least as fast as It consists in restrict- 
ing the optimization to g = 1, in a first step. In such a case, the optimization can be performed 

by the Nelder Mead method. It provides a first estimate ^ai(i), d;(i), W{i), o-(i), (i(i) j . In a 

second step, we set q = 2, and start the Nelder-Mead algorithm with the value ^ai(i), 0, &i(i), 0, 

d(i) , , , j which yields a new estimate (^ai(2) , 02(2) , ^1(2) , ^2(2) , "(2) , w>(2) , o-(2) , d{2)^ , 
and so on until the actual value of q is reached. 

"^see also Lagarias, Reeds, Wright, and Wright (1998) for a recent discussion of the convergence of the method. 
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The figure [23] illustrates the convergence of this procedure in the case of a numerical exper- 
iment that unfolds as follows : 

1 . we generate a reference density with a chosen q, 

2. we apply the two procedures using the Nelder-Mead method, 

3. we compare the accuracy of the two estimated densities to the reference density, 

4. step (1) and (2) are iterated one thousand times. 

The two graphs show the efficiency of the step by step method. On the left panel, the reference 
density has been drawn for q = 5 and we estimated the densities for q = 5 too. The best 
estimated density is irrevocably the one obtained by the iterative procedure. On the right panel, 
we account for the fact that the density should have an infinite number of parameters (see 
equation (|46|) ) or, at least, that the right order q in (|49|) is generally unknown, that is why we 
have generated a reference density with g = 10 and performed the estimation for q = 5 only 
to investigate the effect of the truncation on the accuracy of the two approaches. The result 
obtained for a randomly chosen simulation is displayed on the right panel of figure [23l One 
more time, the iterative method gives better results than global approach. 



[Insert figure\23\about here] 



While we have not been able to prove the convergence of the "step by step" procedure, our 
numerical simulations show that it provides estimates that are always close to the true parameter 
values (within the uncertainty predicted by proposition [3]). In addition, these estimates are 
almost always more accurate than the estimates obtained by use of the genetic algorithm, due 
to the very slow convergence of this algorithm. 
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Symbol 


Company 


Market 


Sector 


Cap. 


BKS 


Barnes & Noble Inc. 


NYSE 


dist. (c) 


1.62 


VLO 


Valero Energy corp (new) 


NYSE 


energy 


1.70 


DHI 


DR Horton inc. 


NYSE 


dist. (c) 


1.83 


LEN 


Lennar corp CL a common 

It 


NYSE 


dist. (c) 


1.85 


TCB 


TCF Financial Corp. 


NYSE 


financial 


1.92 


NYB 


New York Bancorp inc. 


NYSE 


financial 


2.17 


FNF 


Fidelity Natl Financial inc. 


NYSE 


financial 


2.27 


MCHP 


Microchip Technology inc. 


Nasdaq 


tech. 


2.99 


WPO 


Washington post co clb 


NYSE 


com. 


3.71 


GILD 


Gilead Science inc. 


Nasdaq 


dist. (nc) 


3.90 


GM 


General Motors co. 


NYSE 


dist. (c) 


31.81 


PG 


Procter & Gamble co. 


NYSE 


dist. (nc) 


84.30 


IBM 


Intel Business Machines corp. 


NYSE 


tech. 


108.01 


CSCO 


Cisco Systems inc. 


Nasdaq 


com. 


112.26 


MRK 


Merck & co inc. 


NYSE 


dist. (nc) 


112.73 


KO 


Coca-Cola co. 


NYSE 


dist. (nc) 


118.97 


AIG 


American intl group inc. 


NYSE 


financial 


125.58 


INTC 


Intel corp. 


Nasdaq 


tech. 


140.77 


WMT 


Wall-Mart Stores inc. 


NYSE 


dist. (c) 


144.61 


C 


Citigroup 


NYSE 


financial 


148.79 


PFE 


Pfizer inc. 


NYSE 


dist. (nc) 


198.20 


XOM 


Exxon mobile corporation 


NYSE 


energy 


199.19 


MSFT 


Microsoft corp. 


Nasdaq 


tech. 


239.81 


GE 


General Electrics co. 


NYSE 


industrial 


290.44 



Table 1: Average capitalization (in billion dollars) of every assets over the whole period 
(01/01/1994 to 12/31/2003) and their characteristics. 
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Asset 




d 


d 


gph 


d 


BKS 


0.19 


(0.04) 


0.36 


(0.02) 


0.44 


VLO 


0.23 


(0.04) 


0.32 


(0.02) 


0.44 


DHI 


0.34 


(0.04) 


0.34 


(0.02) 


0.50 


LEN 


0.31 


(0.05) 


0.34 


(0.02) 


0.53 


TCB 


0.31 


(0.04) 


0.35 


(0.02) 


0.43 


NYB 


0.00 


(0.06) 


0.33 


(0.02) 


0.37 


FNF 


0.25 


(0.04) 


0.34 


(0.02) 


0.28 


MCHP 


0.19 


(0.04) 


0.45 


(0.02) 


0.42 


WPO 


0.45 


(0.04) 


0.28 


(0.02) 


0.46 


GILD 


0.37 


(0.06) 


0.44 


(0.02) 


0.42 


GM 


-0.30 


(0.04) 


0.30 


(0.02) 


0.30 


PG 


0.22 


(0.05) 


0.41 


(0.02) 


0.53 


IBM 


0.21 


(0.05) 


0.36 


(0.02) 


0.44 


CSCO 


0.33 


(0.06) 


0.47 


(0.02) 


0.51 


MRK 


0.22 


(0.05) 


0.37 


(0.02) 


0.49 


KO 


0.16 


(0.05) 


0.41 


(0.02) 


0.55 


AIG 


0.10 


(0.03) 


0.44 


(0.02) 


0.48 


INTC 


0.23 


(0.06) 


0.38 


(0.02) 


0.53 


WMT 


0.31 


(0.04) 


0.42 


(0.02) 


0.46 


C 


-0.02 


(0.04) 


0.40 


(0.02) 


0.51 


PFE 


0.16 


(0.05) 


0.40 


(0.02) 


0.55 


XOM 


-0.09 


(0.04) 


0.41 


(0.02) 


0.49 


MSFT 


0.21 


(0.03) 


0.43 


(0.02) 


0.53 


GE 


0.04 


(0.06) 


0.44 


(0.02) 


0.50 



Table 2: Estimation of the long memory parameter, d, by our method and different semi- 
parameter methods (Geweke and Porter-Hudak (GPH) and Hurst) for all the assets, over the 
whole period (from 01/01/1994 to 12/31/2003). For the GPH estimator we also give the stan- 
dard deviation. 
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Whole 
period 


Before 
bubble burst 


After 
bubble burst 


csco 


d(a) 
w 

a 


0.33 (0.06) 

(0.12) 
(0.12) 


0.27 (0.04) 

(0.57) 
(0.80) 


0.18(0.04) 

(0.55) 
(0.55) 


MRK 


d(a) 
w 

a 


0.22 (0.05) 
(0.43) 
(0.31) 


0.30 (0.04) 
(0.43) 
(0.66) 


0.08 (0.04) 
(0.45) 
(0.39) 




a (^a) 
w 
a 


(0.46) 
(0.62) 


n 1 Q 0A\ 

yj. iy [yj.yj^ j 
(0.41) 
(0.66) 


(0.47) 
(0.41) 


AIG 


d(a) 
w 

a 


0.10(0.03) 
(0.64) 
(0.64) 


0.26 (0.05) 
(0.61) 
(0.45) 


-0.32 (0.04) 
(0.12) 
(0.85) 


MSFT 


d(a) 
w 

a 


0.21 (0.03) 
(0.66) 
(0.54) 


0.34 (0.04) 
(0.70) 
(0.50) 


0.00 (2.10^) 
(0.05) 
(0.30) 



Table 3: Estimation of the long memory parameter d (and its satandard deviation), the parameter 
a and the weight w by using the auto-correlations method for some large capitalizations, over 
the whole period (from 01/01/1994 to 12/31/2003). 
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Figure 1: Daily realized volatility of two assets from 01/01/1994 to 12/31/2003. On the left a middle capitalization : The Washington Post, 
and on the right a large capitalization : Coca Cola. 




Lag h Lag h 



Figure 2: Auto-correlation function of the realized volatility of two assets from 01/01/1994 to 12/31/2003. On the left a middle capitalization 
: The Washington Post, and on the right a large capitalization : Coca Cola. 




Figure 3: Density of log((Jt) and their approached normal law for different assets. On the left a middle capitalization : Microchip Technology 
inc., and on the right a large capitalization : Procter & Gamble co. 




Figure 4: The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneity coefficient a for different values of a ranging in [—0.45, —0.05] and /? = 1.5. The law of is a Beta(— q;,1 + a) and its density 
equal to f{^p) = ^.J^-^^^^ — (p)" explains the divergence near and 1. 
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Figure 5: Auto-correlation function over 200 days for /3 = 1.5 and different values of a. 
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Figure 6: Auto-correlation function over 500 days for p = 5, g = 0.75, a = 0.8 and different 
values of cp' ranging in [0.60, 0.79]. 
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Figure 7: The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneous coefficient for q = 0.75, a = 0.3 and different values of p ranging in [1.05, 8]. The law of is a Beta(p,g) extended over 
[—1, 1] and its density equal to f{ip) = ^p+g_j^, — r(l + (py^^{l — (f)^^^ explains the divergence near and 1. 




Figure 8: The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneous coefficient for p = 5, a = 0.3 and different values of q ranging in [0.55, 0.85]. The law of is a Beta(p,g) extended over 
[—1, 1] and its density equal to f{ip) = ^p+g_j^, — r(l + (py^^{l — ^Y^^ explains the divergence near and 1. 
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Figure 9: The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneous coefficient for p = 5, g = 0.75 and different values of a ranging in [0.1, 0.9]. The law of (y9 is a Beta(p,g) extended over 
[— 1, 1] and its density equal to = ^p+g_l^, — y(l + (py^^^^l — v?)*^^^ explains the divergence near and 1. 




Figure 10: The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneous coefficient for p = 5, g = 0.75, a = 0.3, m = 0, a = 0.2 and different values of w ranging in [1/4, 3/4]. The density of the 

law of ip is equal to = 'w^^^^^rr^^^^l + ipy~^{l — (pY~^ + (1 — + V^)(l ~ exp ^— l ^*^^™-* j explains the divergence near 

and 1. 




Figure 1 1 : The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneous coefficient for p = 5, q = 0.75, a = 0.3, m = 0, w = 1/2 and different values of a ranging in [0.1, 0.3]. The density of the 

law of ip is equal to = ui ^p+^^l^^^^^ ^l + ipy~^{l — (pY~^ + (1 — + V^)(l ~ ^^P ) explains the divergence near 

and 1. 




Lag h iji 



Figure 12: The left panel and the right panel represent respectively the auto-correlation function over 200 days and the density of the 
heterogeneous coefficient for p = 5, g = 0.75, a = 0.3, a = 0.2, w = 1/3 and different values of m ranging in [—0.8, 0.8]. The density of 

the law of Lp is equal to f{{p) = w ^p+g_l^^^ ^^ (1 + {py~^{l — (p^'^ + (1 ~ '^)^{^ + '^){^ ~ v) 6xp (^—^ ^'^ JI'^ j explains the divergence 
near and 1 . 
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Figure 13: Estimation of the density with its 95 % confidence interval (on the left panel) and the auto-correlation function from lag to lag 
120 (on the right panel) for FNF. 





Figure 14: Estimation of the density with its 95 % confidence interval (on the left panel) and the auto-correlation function from lag to lag 
120 (on the right panel) for Microsoft. 
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Figure 15: On the left panel is presented the evolution of the price of Cisco Systems over the period considered in order to study the bubble 
burst effect. On the right panel, the auto-correlations over different periods are drawn. 




Figure 16: Density of the heterogeneity coefficient (on the left) and auto-correlation of the realized log-volatility of Cisco Systems (on the 
right) over the pre-bubble burst period. 




Figure 17: Density of the heterogeneity coefficient (on the left) and auto-correlation of the realized log- volatility of Cisco Systems (on the 
right) over the post-bubble burst period. 




Figure 18: Pre bubble burst density compared to post bubble burst density of Cisco Systems. 



58 



Over the whole period 




1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 20 40 60 80 100 120 

Year Lag h 



Figure 19: On the left panel is presented the evolution of the price of Coca Cola over the period considered in order to study the bubble burst 
effect. On the right panel, the auto-correlations over different periods are drawn. 




Figure 20: Density of the heterogeneity coefficient (on the left) and auto-correlation of the realized log-volatility of Coca Cola (on the right) 
over the pre-bubble burst period. 




Figure 21 : Density of the heterogeneity coefficient (on the left) and auto-correlation of the realized log- volatility of Coca Cola (on the right) 
over the post-bubble burst period. 
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Figure 22: Pre bubble burst density compared to post bubble burst density of Coca Cola. 
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Figure 23: Comparison of the goodness of the two Nelder Mead minimization algorithms. The estimated density is always obtained for 
g = 5. On the left panel, the reference density is calculated with q = 5 and on the right one with q = 10. 



